125 research outputs found

    Predictive Top-Down Integration of Prior Knowledge during Speech Perception

    Get PDF
    A striking feature of human perception is that our subjective experience depends not only on sensory information from the environment but also on our prior knowledge or expectations. The precise mechanisms by which sensory information and prior knowledge are integrated remain unclear, with longstanding disagreement concerning whether integration is strictly feedforward or whether higher-level knowledge influences sensory processing through feedback connections. Here we used concurrent EEG and MEG recordings to determine how sensory information and prior knowledge are integrated in the brain during speech perception. We manipulated listeners' prior knowledge of speech content by presenting matching, mismatching, or neutral written text before a degraded (noise-vocoded) spoken word. When speech conformed to prior knowledge, subjective perceptual clarity was enhanced. This enhancement in clarity was associated with a spatiotemporal profile of brain activity uniquely consistent with a feedback process: activity in the inferior frontal gyrus was modulated by prior knowledge before activity in lower-level sensory regions of the superior temporal gyrus. In parallel, we parametrically varied the level of speech degradation, and therefore the amount of sensory detail, so that changes in neural responses attributable to sensory information and prior knowledge could be directly compared. Although sensory detail and prior knowledge both enhanced speech clarity, they had an opposite influence on the evoked response in the superior temporal gyrus. We argue that these data are best explained within the framework of predictive coding in which sensory activity is compared with top-down predictions and only unexplained activity propagated through the cortical hierarchy

    An information theoretic characterisation of auditory encoding.

    Get PDF
    The entropy metric derived from information theory provides a means to quantify the amount of information transmitted in acoustic streams like speech or music. By systematically varying the entropy of pitch sequences, we sought brain areas where neural activity and energetic demands increase as a function of entropy. Such a relationship is predicted to occur in an efficient encoding mechanism that uses less computational resource when less information is present in the signal: we specifically tested the hypothesis that such a relationship is present in the planum temporale (PT). In two convergent functional MRI studies, we demonstrated this relationship in PT for encoding, while furthermore showing that a distributed fronto-parietal network for retrieval of acoustic information is independent of entropy. The results establish PT as an efficient neural engine that demands less computational resource to encode redundant signals than those with high information content

    Attentional Modulation of Envelope-Following Responses at Lower (93–109 Hz) but Not Higher (217–233 Hz) Modulation Rates

    Get PDF
    Directing attention to sounds of different frequencies allows listeners to perceive a sound of interest, like a talker, in a mixture. Whether cortically generated frequency-specific attention affects responses as low as the auditory brainstem is currently unclear. Participants attended to either a high- or low-frequency tone stream, which was presented simultaneously and tagged with different amplitude modulation (AM) rates. In a replication design, we showed that envelope-following responses (EFRs) were modulated by attention only when the stimulus AM rate was slow enough for the auditory cortex to track—and not for stimuli with faster AM rates, which are thought to reflect ‘purer’ brainstem sources. Thus, we found no evidence of frequency-specific attentional modulation that can be confidently attributed to brainstem generators. The results demonstrate that different neural populations contribute to EFRs at higher and lower rates, compatible with cortical contributions at lower rates. The results further demonstrate that stimulus AM rate can alter conclusions of EFR studies.This work was supported by funding from the Canadian Institutes of Health Research (CIHR; Operating Grant: MOP 133450) and the Natural Sciences and Engineering Research Council of Canada (NSERC; Discovery Grant: 327429-2012). Authors R.P. Carlyon and H.E. Gockel were supported by intramural funding from the Medical Research Council [SUAG/007 RG91365]

    Effect of Chronic Stimulation and Stimulus Level on Temporal Processing by Cochlear Implant Listeners

    Get PDF
    A series of experiments investigated potential changes in temporal processing during the months following activation of a cochlear implant (CI) and as a function of stimulus level. Experiment 1 tested patients on the day of implant activation and 2 and 6 months later. All stimuli were presented using direct stimulation of a single apical electrode. The dependent variables were rate discrimination ratios (RDRs) for pulse trains with rates centred on 120 pulses per second (pps), obtained using an adaptive procedure, and a measure of the upper limit of temporal pitch, obtained using a pitch-ranking procedure. All stimuli were presented at their most comfortable level (MCL). RDRs decreased from 1.23 to 1.16 and the upper limit increased from 357 to 485 pps from 0 to 2 months post-activation, with no overall change from 2 to 6 months. Because MCLs and hence the testing level increased across sessions, two further experiments investigated whether the performance changes observed across sessions could be due to level differences. Experiment 2 re-tested a subset of subjects at 9 months post-activation, using current levels similar to those used at 0 months. Although the stimuli sounded softer, some subjects showed lower RDRs and/or higher upper limits at this re-test. Experiment 3 measured RDRs and the upper limit for a separate group of subjects at levels equal to 60 %, 80 % and 100 % of the dynamic range. RDRs decreased with increasing level. The upper limit increased with increasing level for most subjects, with two notable exceptions. Implications of the results for temporal plasticity are discussed, along with possible influences of the effects of level and of across-session learning

    The Effect of Visual Cues on Auditory Stream Segregation in Musicians and Non-Musicians

    Get PDF
    Background: The ability to separate two interleaved melodies is an important factor in music appreciation. This ability is greatly reduced in people with hearing impairment, contributing to difficulties in music appreciation. The aim of this study was to assess whether visual cues, musical training or musical context could have an effect on this ability, and potentially improve music appreciation for the hearing impaired. Methods: Musicians (N = 18) and non-musicians (N = 19) were asked to rate the difficulty of segregating a four-note repeating melody from interleaved random distracter notes. Visual cues were provided on half the blocks, and two musical contexts were tested, with the overlap between melody and distracter notes either gradually increasing or decreasing. Conclusions: Visual cues, musical training, and musical context all affected the difficulty of extracting the melody from a background of interleaved random distracter notes. Visual cues were effective in reducing the difficulty of segregating the melody from distracter notes, even in individuals with no musical training. These results are consistent with theories that indicate an important role for central (top-down) processes in auditory streaming mechanisms, and suggest that visual cue

    Evaluation of Possible Effects of a Potassium Channel Modulator on Temporal Processing by Cochlear Implant Listeners

    Get PDF
    Temporal processing by cochlear implant listeners is degraded and is affected by auditory deprivation. The fast-acting Kv3.1 potassium channel is important for sustained temporally accurate firing and is also susceptible to deprivation, the effects of which can be partially restored in animals by the molecule AUT00063. We report the results of a randomised placebo-controlled double-blind study on psychophysical tests of the effects of AUT00063 on temporal processing by CI listeners. The study measured the upper limit of temporal pitch, gap detection, and discrimination of low rates (centred on 120 pps) for monopolar pulse trains presented to an apical electrode. The upper limit was measured using the optimally efficient midpoint comparison (MPC) pitch-ranking procedure; thresholds were obtained for the other two measures using an adaptive procedure. Twelve CI users (MedEl and Cochlear) were tested before and after two periods of AUT00063 or placebo in a within-subject crossover study. No significant differences occurred between post-drug and post-placebo conditions. This absence of effect occurred despite high test-retest reliability for all three measures, obtained by comparing performance on the two baseline visits, and despite the demonstrated sensitivity of the measures to modest changes in temporal processing obtained in other studies from our laboratory. Hence, we have no evidence that AUT00063 improves temporal processing for the doses and patient population employed

    Understanding Pitch Perception as a Hierarchical Process with Top-Down Modulation

    Get PDF
    Pitch is one of the most important features of natural sounds, underlying the perception of melody in music and prosody in speech. However, the temporal dynamics of pitch processing are still poorly understood. Previous studies suggest that the auditory system uses a wide range of time scales to integrate pitch-related information and that the effective integration time is both task- and stimulus-dependent. None of the existing models of pitch processing can account for such task- and stimulus-dependent variations in processing time scales. This study presents an idealized neurocomputational model, which provides a unified account of the multiple time scales observed in pitch perception. The model is evaluated using a range of perceptual studies, which have not previously been accounted for by a single model, and new results from a neurophysiological experiment. In contrast to other approaches, the current model contains a hierarchy of integration stages and uses feedback to adapt the effective time scales of processing at each stage in response to changes in the input stimulus. The model has features in common with a hierarchical generative process and suggests a key role for efferent connections from central to sub-cortical areas in controlling the temporal dynamics of pitch processing

    Pitch Comparisons between Electrical Stimulation of a Cochlear Implant and Acoustic Stimuli Presented to a Normal-hearing Contralateral Ear

    Get PDF
    Four cochlear implant users, having normal hearing in the unimplanted ear, compared the pitches of electrical and acoustic stimuli presented to the two ears. Comparisons were between 1,031-pps pulse trains and pure tones or between 12 and 25-pps electric pulse trains and bandpass-filtered acoustic pulse trains of the same rate. Three methods—pitch adjustment, constant stimuli, and interleaved adaptive procedures—were used. For all methods, we showed that the results can be strongly influenced by non-sensory biases arising from the range of acoustic stimuli presented, and proposed a series of checks that should be made to alert the experimenter to those biases. We then showed that the results of comparisons that survived these checks do not deviate consistently from the predictions of a widely-used cochlear frequency-to-place formula or of a computational cochlear model. We also demonstrate that substantial range effects occur with other widely used experimental methods, even for normal-hearing listeners

    Across-Channel Timing Differences as a Potential Code for the Frequency of Pure Tones

    Get PDF
    When a pure tone or low-numbered harmonic is presented to a listener, the resulting travelling wave in the cochlea slows down at the portion of the basilar membrane (BM) tuned to the input frequency due to the filtering properties of the BM. This slowing is reflected in the phase of the response of neurons across the auditory nerve (AN) array. It has been suggested that the auditory system exploits these across-channel timing differences to encode the pitch of both pure tones and resolved harmonics in complex tones. Here, we report a quantitative analysis of previously published data on the response of guinea pig AN fibres, of a range of characteristic frequencies, to pure tones of different frequencies and levels. We conclude that although the use of across-channel timing cues provides an a priori attractive and plausible means of encoding pitch, many of the most obvious metrics for using that cue produce pitch estimates that are strongly influenced by the overall level and therefore are unlikely to provide a straightforward means for encoding the pitch of pure tones
    corecore